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SUMMARY : A regulator for linear plants is proposed whose mode~of operation 

allows plant perturbations to go unchecked as long as they are within some 
bounding hyper surface in perturbed state space. This bounding hypersurface 
represents some limit on acceptable plant perturbations. When a perturbed 
state attains the hyper surf ace a linear feedback control is initiated, 
optimally driving the state back toward the origin for a fixed amount of 
time with a fixed amount of control energy. At control shutdown there 
exists some residual perturbation and the mode of operation continues. 

A conventional solution for the optimal control for each operation 
leads to an equation for an undetermined multiplier which depends on the 
initial phase of the plant. With the above mode of operation the initial 
phase is not known a priori so that an on-line solution to the multiplier 
equation is necessary to specify the control prior to each operation. It 
is shown in this paper how this requirement may be avoided by interpreting 
the multiplier equation geometrically as a family of hypersurfaces in state 
space. By utilizing one of these hypersurfaces as the bounding hypersurface 
required in the mode of operation, the optimal control law may be specified 
before the regulator is put into service. The results of the regulator 
applied to a linear oscillator are given to illustrate its mode of 
operation. 


PROBLEM STATEMENT 

The system considered may be written 
in vector -matrix notation 

x = F(t)x + G(t)u ^ (1) 

where xeR 11 is the plant perturbation 
from equilibrium and ueEP is the con- 
trol. The system is assumed to be com- 
pletely controllable . 1 

The regulator 1 s mode of operation 
allows plant perturbations due to dis- 
turbances and possibly plant instability 
with no control action as long as they 
are within some bounding hypersurface in 
perturbed state space. This bounding 
hypersurface represents the limiting 
value of acceptable plant perturbations. 
When a perturbation attains this hyper- 
surface, say at time to, the control 


is initiated and operates for a fixed 
amount of time, tq - t G , to drive the 
perturbed state back toward the origin. 

At tq the control is shut down and the 
regulator’s cycle repeats. This paper 
deals with the control and the bounding 
hypersurface which determines the initial 
phase (xq, t Q ) for each operation. 

The control is to be designed so that 
for each operation it (i) mini m izes the 
error criterion 


J * |||x(ti) 


1 />> 


Q(t) 


at 


(2) 


where A and Q(t) are finite symmetric 
nonnegative definite matrices at least 
one of which is positive definite and 

Q(t)eC^ for all te [tctj and (ii) 
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R(t) dt 


(3) 


where R(t) is a symmetric positive 
definite matrix and R(t)eC 2 for all 
te(t°,t£] . The design admits plant per-’ 

turbations at shutdown and the control 
energy, E, may therefore be specified 
less than the minimum energy required to 
get to the origin. This procedure 
avoids an unbounded control and other 
associated problems. The minimum 
energy, E°, is given by 


E° = | x 0 




( 4 ) 


where W(t 0 ,ti) is the controllability 

matrix. 1 The bounding hyper surf ace 
must, therefore, be such that (i) it 
lies within the hypersurface defined by 
the equation 


E 


1 

2 



2 

w -1 (t 0 ,ti) 


(5) 


and that (ii) all perturbations within 
and on the bounding hypersurface are 
acceptable. Requirement (ii) depends 
on the accuracy with which the linear 
equations represent the physical system 
as well as the designers needs. 


As in reference 2 the conjugate variable 
is denoted |eR n and the Hamiltonian is 
written 


H(x,|,u,t) 



+ 2| • [V(t)x + G(t)uj 


+ A 




(7) 


The Hamiltonian is then minimized with 
respect to u to give as the optimal 
control function 


u°(t) = - i R -1 (t)G'(t)6(t) (8) 

A 

where | is related to the system 
through the canonic equations 


x(t) 


F(t) 

- i R -1 (t)G' (t) 


x(t) 

eC-t) 


-Q(t) 

-F'(t) 


l(t) 


(9) 


This set of equations have the boundary 

p 

conditions 


x(t Q ) = x 0 (10) 


SOLUTION FOR THE CONTROL 


lUp) = Ax(ti) (ll) 


The first step is to adjoin (3) to 
(2) with a constant positive real 
Lagrange multiplier to give as the 
adjoined criterion 


V 


1 



+ A 


u(t) 


2 



dt 


x(t) 


2 

Q(t) 

( 6 ) 


To solve for |(t) 
set of solutions to 


let 

(9) 


such that 


be a 


X(ti,ti) = I (12) 

= A (13) 

Then by linearity 

x(t) = X(t,ti)x(t!) 


1 

J 


R(t) 




I 


|(t) = ^(t,tl)x(tl) 


(15) where 



In reference 2 X(t,tq) is shown to he 
the fundamental matrix of the optimally 
controlled plant. Then with the energy 
level, E, chosen such that x(tq) / 0 , 

X" 1 (t,tp) exists for all tejt 0 ,tfj 
and |(t) may he written 

!(t) = (16) 

Equations (l6) and (8) give for the 
optimal control 

K°(t,x)=-i R- 1 (t)G*(t)H(t,ti)X- 1 (t,ti)x 

A 

(17) 

vhere K°(t,x) replaces u°(t) to 
denote optimal feedback control. A has 
yet to he determined; however, we may 
examine its relation to the specific 
energy level, E, the fixed time, 
ti - t Q , and the system dynamics. 

Substitution of ( 15 ) into (8) gives 
the control in terms of the terminal 
state. This then may he substituted 
into (3) to give as the constraint 
equation 

I 2 

E = x(t!) ( 18 ) 

I D 

where 

D = f 1 H*(t,ti)G(t)R-l(t) 

&T J t Q 

X G'(t)S(t,t 1 )dLt ( 19 ) 

Then by utilizing ( Ik ) in (l8), the 
final form of the energy constraint 
equation is 

2 

E = x c ( 20 ) 

B 


B = X'l’CtojtJlX-^tojti) ( 21 ) 

Now the standard solution to the 
problem would require the solution of 
(20) for a positive real multiplier 
prior to each operation since from (20) 
and (21) it is clear that, in general, 

A Is a function of the initial phase 
(x 0 ,t 0 ). To this end an analog method 
for continuously tracking the multiplier 
as a function of the phase for the case 
when Q(t) ~ 0 is essentially given in 
reference 3* A method will now be shown 
whereby the on-line tracking or computing 
of A can be avoided by employing the 
aforementioned concept of a bounding 
hypersurf ace . 

THE BOUNDING HYPERSURFACE 

The regulator's mode of operation 
requires that some activation criterion 
be used to begin each control operation. 
In view of this, it is noted that for 
each t G and with A as a parameter, 
(20) defines a family of hypersurfaces 
in state space from which the optimal 
control associated with A will use E 
amount of energy. If, for each t Q , 
there is a member of this family suitable 
for a bounding hypersurf ace , then by 
using these hypersurfaces to define a 
bounding hypersurface varying with t Q , 
the on-line computation of A(t 0 ) may 
be avoided. 

The method of using this activation 
criterion is to feed back the state with 
time varying gains to form 

2 

e = Xq (22) 

B(A(t 0 ),t 0 ) 

As long as e is less than E the con- 
trol remains off, but when e equals E 
the control Is initiated and the optimal 
feedback control law associated with the 
A(t 0 ) is used. Of course, for time 
constant systems there will only be one 
A so the bounding hyper surf ace will not 
be time varying. 



(25) 


It should he pointed out that (l8) 
is a direct indication of the effective- 
ness of the system design since for 
each t 0 it defines the hyper surf ace 
to which the control will drive the 
state. 


A SIMPLE EXAMPLE 

In the interest of simplicity, the 
example plant Is a constant undamped 
harmonic oscillator. In spite of its 
simplicity, it illustrates the regula- 
tor^ usefulness on the class of con- 
stant neutrally stable systems subject 
to small disturbances. The symmetric 
spinning space vehicle is a member of 
this class in which considerable 
interest has been shown. ^>5 >6, 7 


J = + x|(T)j 

This norm is constant for the free plant 
and is a good measure of the system per- 
turbation from equilibrium. Typically, 
neutrally stable systems have constants 
of free motion which may be expressed as 
quadratic forms in the state variables. 
One of these forms is often useful as an 
error criterion. 

The energy constraint is chosen as 
half the integral square energy by 
letting R = 1 which gives 



The vector-matrix equation is 
written 


X 1 

x 2 


0 1 

-1 0 


X 1 

x 2 


u ( 23 ) 


where x^ is the displacement and x 2 

is the velocity. The system is required 
to run from t Q = 0 to tq = T. The 
controllability matrix is then computed 
with the result 


W(o,T) = | 


£ - i sin 2T 


i - i cos 2T 


i- - — cos 2T 


T 1 

- + i sin 2T 


This completes the problem statement. 
Use of equation ( 9 ) results in the 
canonic differential equations 


X 1 

x 2 

ii 

^2 


0 10 
-10 0 

0 0 0 
0 0-1 


X 1 

x 2 


^2 


(27) 


The solution of (27) with the appropriate 
terminal conditions results in 


(28) 


\k 4 

2 4 J 

(2k) 

H(t,T) = 

cos(T 

- t) 

-sin(T 

- 1) 

W(o,T) may be shown to 

be positive 


sin(T 

- t) 

cos(T 

- 1) 

definite for all T -f 0 

so the system 







is controllable. 


To further simplify the problem, 
only the terminal error will be mini- 
mized and the A matrix will be the 
identity matrix. This makes the error 
criterion circular in nature 


-t 


and 


X _1 (t,T) = i 


(2A+T-t)cos(T-t)+ sin(T-t) 
2A 


-(2A+T-t) 

2X 


(2A+T-t) 

2A 


sin(T-t) 


sin(T-t) 


cos(T-t) - -i- sin(T-t) 

2A ' 2A 


(29) 


where 


r 

-| 2 r 1 

L 1 + x< t - * 

)J . ^ sin(T - t)J 


(30) 


Direct substitution of ( 28 ) and (29) 
into ( 17 ) and ( 19 ) results in the 
optimal feedback control law 


K ° (t ’ x) ‘ s(a sl ° 2(T • t) ) Xl 

+ - t) + 1 


- sin(T - t )cos(T - 

(31) 

and the D matrix 


T sin 2T sin% 

2 4 2 


computed and the activation hypersurface 
determined. 

From X -1 (o,T) and D, it is evident 
that with T an even multiple of 11 , B 
defines a circular hypersurf ace . This 
indicates that, in general, the time of 
operation may be a vital parameter in 
shaping the B-hypersurface. In the 
example, however, a more general case has 
been chosen by setting T = 10. Since 
the period of the oscillator's free 
motion is 2n , the control is therefore 
in operation for more than one period. A 
displacement of 5 and a zero velocity was 
chosen rather arbitrarily to be on the 
B-hypersurface . By using this point and 
(24) in equation (4), the minimum energy 
is found to be 2.62 units. The energy in 
this problem was chosen as 2.5 units to 
be within this limit. Equation (20) was 
then solved for A. Its value was 0 . 115 . 
The resulting B-hypersurface and 
D-hypersurface which are slightly ellip- 
tical are shown in figure 1. A typical 
trajectory during a control operation is 
shown between the two hyper surf a ce s . The 
time history of this trajectory and its 
associated control are shown in figure 2. 



sin^T 

2 


(32) 


T + sin 2T 

2 k 


With the substitution of ( 29 ) and ( 32 ) 
into (21), the B matrix is determined. 
E, T, and some point desired to lie on 
the control activation hyper surface may 
now be chosen so that A may be 


CONCLUSIONS 

A special family of hypersurfaces in 
state space has been identified with the 
optimal linear regulator. It has been 
shown that on-line computation of the 
multiplier associated with the control 
constraint may be eliminated by choosing 
a member of this special family as the 
regulator's activation criterion. This 


results in a relatively simple optimal 
control system suitable for repetitive 
operation. 
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Figure 1.- Hypersurface geometry and a typical optimal trajectory. 




Figure 2.- Typical trajectory and control time history. 
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